模式识别与人工智能
2025年4月4日 星期五   首 页     期刊简介     编委会     投稿指南     伦理声明     联系我们                                                                English
模式识别与人工智能  2018, Vol. 31 Issue (10): 887-898    DOI: 10.16451/j.cnki.issn1003-6059.201810003
论文与报告 最新目录| 下期目录| 过刊浏览| 高级检索 |
基于矩阵加权关联规则的跨语言查询译后扩展
黄名选1,2,蒋曹清1,2,何冬蕾1,2
1.广西跨境电商智能信息处理重点实验室培育基地(广西财经学院) 南宁 530003
2.广西财经学院 信息与统计学院 南宁 530003
Cross Language Query Post-Translation Expansion Based on Matrix-Weighted Association Rules
HUANG Mingxuan1,2, JIANG Caoqing1,2, HE Donglei1,2
1.Guangxi Key Laboratory Cultivation Base of Cross-Border E-commerce Intelligent information Processing, Guangxi University of Finance and Economics, Nanning 530003
2.School of Information and Statistics, Guangxi University of Finance and Economics, Nanning 530003

全文: PDF (871 KB)   HTML (1 KB) 
输出: BibTeX | EndNote (RIS)      
摘要 

首先提出矩阵加权项集支持度计算方法,给出面向跨语言查询扩展的矩阵加权关联模式挖掘算法.然后提出基于矩阵加权关联规则挖掘的跨语言查询译后扩展算法.借助机器翻译进行首次跨语言检索,得到前列初检文档,并经用户相关性判断后得到相关反馈文档.通过计算支持度从相关反馈文档中挖掘含有原查询词的矩阵加权频繁项集,通过置信度-兴趣度评价框架从频繁项集中提取含有原查询词的关联规则,将规则的后件或前件作为扩展词,利用规则的置信度和兴趣度衡量扩展词的重要性,完成跨语言查询译后扩展.在NTCIR-5 CLIR标准测试集上的实验表明,文中算法可以有效提升跨语言查询扩展性能,有利于长查询的跨语言检索,译后后件扩展性能优于前件.

服务
把本文推荐给朋友
加入我的书架
加入引用管理器
E-mail Alert
RSS
作者相关文章
黄名选
蒋曹清
何冬蕾
关键词 矩阵加权关联模式关联规则查询扩展跨语言信息检索    
Abstract

A computing method for matrix-weighted itemset support is proposed firstly, and the algorithm of matrix-weighted association patterns mining for cross-language query expansion is presented. Then, the algorithm of cross-language query post-translation expansion is put forward based on matrix-weighted association rules mining. The first cross-language retrieval is performed to obtain the top initially retrieved documents(TIRDs) by machine translation, and the relevance feedback documents(RFDs) are gained from TIRDs by user correlation judgment. The matrix-weighted frequent itemsets containing original query terms are mined from RFDs by means of computing support and the association rules with original query terms are extracted from frequent itemsets according to the evaluation framework of confidence-interest. To implement cross-language query post-translation expansion, the consequents or antecedents of the rules are treated as expansion terms and the importance of the expansion terms is measured by the confidence and interest of the rule. Experiments on NTCIR-5 CLIR standard test set show that the proposed algorithm improves the performance of cross-language query expansion, and it is beneficial in cross-language retrieval of long queries. The performance of post-translation consequent expansion is better than that of the antecedent one.

收稿日期: 2018-05-08     
ZTFLH: TP 311  
基金资助:

国家自然科学基金项目(No.61762006,61662003,61262028)资助

作者简介: 黄名选(通讯作者),硕士,教授,主要研究方向为数据挖掘、信息检索、机器学习.E-mail:mingxh05@163.com.;蒋曹清,博士,教授,主要研究方向为形式化方法、数据挖掘.E-mail:jcqng@163.com.;何冬蕾,硕士,助教,主要研究方向为英语语言文学、计算语言学.E-mail:smiley1128@163.com.
引用本文:   
黄名选,蒋曹清,何冬蕾. 基于矩阵加权关联规则的跨语言查询译后扩展[J]. 模式识别与人工智能, 2018, 31(10): 887-898. HUANG Mingxuan, JIANG Caoqing, HE Donglei. Cross Language Query Post-Translation Expansion Based on Matrix-Weighted Association Rules. , 2018, 31(10): 887-898.
链接本文:  
http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.201810003      或     http://manu46.magtech.com.cn/Jweb_prai/CN/Y2018/V31/I10/887
版权所有 © 《模式识别与人工智能》编辑部
地址:安微省合肥市蜀山湖路350号 电话:0551-65591176 传真:0551-65591176 Email:bjb@iim.ac.cn
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn